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mim 



1 SEQUENCE LISTING HO Crp HP 

3 (1) General Information: 
4 

5 (i) APPLICANT: Amara, Susan G 

6 Arriza, Jeffrey L 
7 

8 (ii) TITLE OF INVENTION: Amino Acid Transporters and Uses 
9 

10 (iii) NUMBER OF SEQUENCES: 17 

11 

12 (iv) CORRESPONDENCE ADDRESS: 

13 (A) ADDRESSEE: Allegretti & Witcoff, Ltd. 

14 (B) STREET: 10 South Wacker Drive, Suite 3000 

15 (C) CITY: Chicago 

16 (D) STATE: IL 

17 (E) COUNTRY: USA 

18 (F) ZIP: 60606 
19 

2 0 (v) COMPUTER READABLE FORM: 

21 (A) MEDIUM TYPE: Floppy disk 

22 (B) COMPUTER: IBM PC compatible 

23 (C) OPERATING SYSTEM: PC-DOS/MS -DOS 

24 (D) SOFTWARE: Patentin Release #1.0, Version #1.25 
25 

26 (vi) CURRENT APPLICATION DATA: 

27 (A) APPLICATION NUMBER: US 08/140,729 
2 8 (B) FILING DATE: 2 0 OCT 1993 
29 (C) CLASSIFICATION: 
30 

31 (viii) ATTORNEY/AGENT INFORMATION: 

32 (A) NAME: Noonan, Kevin E 

33 (B) REGISTRATION NUMBER: 35,303 

34 (C) REFERENCE/DOCKET NUMBER: 93,509 
35 

36 (ix) TELECOMMUNICATION INFORMATION: 

37 (A) TELEPHONE: 312-715-1000 

38 (B) TELEFAX: 312-715-1234 

39 (C) TELEX: 910-221-5317 
40 
41 

42 (2) INFORMATION FOR SEQ ID NO : 1 : 
43 

44 (i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 63 base pairs 

46 (B) TYPE: nucleic acid 

47 (C) STRANDEDNESS : single 

48 (D) TOPOLOGY: linear 
49 

5 0 (ii) MOLECULE TYPE: cDNA 
51 
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INPUT SET: S7433.ra\^ 



52 
53 

54 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

55 

56 CTGRGCRATG AARATGGCAG CCAGGGCYTC ATACAGGGCT GTGCCRTCCA TGTTRATGGT 60 
57 

58 RGC 63 
59 

60 (2) INFORMATION FOR SEQ ID NO : 2 : 
61 

62 (i) SEQUENCE CHARACTERISTICS: 

63 (A) LENGTH: 1680 base pairs 

64 (B) TYPE: nucleic acid 

65 (C) STRANDEDNESS: single 

66 ( D ) TOPOLOGY : 1 inear 
67 

68 (ii) MOLECULE TYPE: cDNA 

69 

70 (ix) FEATURE: 

71 (A) NAME/KEY: 5 ' UTR 

72 (B) LOCATION: 1..30 
73 

74 (ix) FEATURE: 

75 (A) NAME/KEY: CDS 

76 (B) LOCATION: 31.. 1626 
77 

78 (ix) FEATURE: 

79 (A) NAME/KEY: 3 ' UTR 

80 (B) LOCATION: 1626.. 1680 
81 

82 

83 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

84 

8 5 CACCTCTAGC TCGGAGCGGC GTGTAGCGCC ATG GAG AAG AGC AAC GAG ACC AAC 54 

86 Met Glu Lys Ser Asn Glu Thr Asn 

87 15 
88 

8 9 GGC TAC CTT GAC AGC GCT CAG GCG GGG CCT GCG GCC GGG CCC GGA GCT 102 

90 Gly Tyr Leu Asp Ser Ala Gin Ala Gly Pro Ala Ala Gly Pro Gly Ala 

91 10 15 20 
92 

93 CCG GGG ACC GCG GCG GGA CGC GCA CGG CGT TGC GCG CGC TTC CTG CGG 150 

94 Pro Gly Thr Ala Ala Gly Arg Ala Arg Arg Cys Ala Arg Phe Leu Arg 

95 25 30 35 40 
96 

97 CGC CAA GCG CTG GTG CTG CTC ACC GTG TCC GGG GTG CTG GCG GGC GCG . 198 

98 Arg Gin Ala Leu Val Leu Leu Thr Val Ser Gly Val Leu Ala Gly Ala 

99 45 50 55 
100 

101 GGC CTG GGC GCG GCG TTG CGC GGG CTC AGC CTG AGC CGC ACG CAG GTC 246 

102 Gly Leu Gly Ala Ala Leu Arg Gly Leu Ser Leu Ser Arg Thr Gin Val 
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INPUT SET: S7433.raw 

103 60 65 70 

104 

105 ACC TAG CTG GCC TTC CCC GGC GAG ATG CTG CTC CGC ATG CTG CGC ATG 2 94 

106 Thr Tyr Leu Ala Phe Pro Gly Glu Met Leu Leu Arg Met Leu Arg Met 

107 75 80 85 
108 

109 ATC ATC CTG CCG CTG GTG GTC TGC AGC CTG GTG TCG GGC GCC GCC TCG 342 

110 lie lie Leu Pro Leu Val Val Cys Ser Leu Val Ser Gly Ala Ala Ser 

111 90 95 100 
112 

113 CTC GAT GCC AGC TGC CTC GGG CGT CTG GGC GGC ATC CGT GTC GCC TAC 3 90 

114 Leu Asp Ala Ser Cys Leu Gly Arg Leu Gly Gly lie Arg Val Ala Tyr 

115 105 110 115 120 
116 

117 TTT GGC CTC ACC ACA CTG AGT GCC TCG GCG CTC GCC GTG GCC TTG GCG 43 8 

118 Phe Gly Leu Thr Thr Leu Ser Ala Ser Ala Leu Ala Val Ala Leu Ala 

119 125 130 135 
120 

121 TTC ATC ATC AAG CCA GGA TCC GGT GCG CAG ACC CTT CAG TCC AGC GAC 486 

122 Phe lie lie Lys Pro Gly Ser Gly Ala Gin Thr Leu Gin Ser Ser Asp 

123 140 145 150 
124 

12 5 CTG GGG CTG GAG GAC TCG GGG CCT CCT CCT GTC CCC AAA GAG ACG GTG 534 

126 Leu Gly Leu Glu Asp Ser Gly Pro Pro Pro Val Pro Lys Glu Thr Val 

127 155 160 165 
128 

129 GAC TCT TTC CTC GAC CTG GCC AGA AAC CTG TTT CCC TCC AAT CTT GTG 582 

13 0 Asp Ser Phe Leu Asp Leu Ala Arg Asn Leu Phe Pro Ser Asn Leu Val 
131 170 175 180 

132 

133 GTT GCA GCT TTC CGT ACG TAT GCA ACC GAT TAT AAA GTC GTG ACC. CAG 630 

134 Val Ala Ala Phe Arg Thr Tyr Ala Thr Asp Tyr Lys Val Val Thr Gin 

135 185 190 195 200 
136 

13 7 AAC AGC AGC TCT GGA AAT GTA ACC CAT GAA AAG ATC CCC ATA GGC ACT 678 

138 Asn Ser Ser Ser Gly Asn Val Thr His Glu Lys lie Pro lie Gly Thr 

139 205 210 215 
140 

141 GAG ATA GAA GGG ATG AAC ATT TTA GGA TTG GTC CTG TTT GCT CTG GTG 726 

142 Glu He Glu Gly Met Asn He Leu Gly Leu Val Leu Phe Ala Leu Val 

143 220 225 230 
144 

145 TTA GGA GTG GCC TTA AAG AAA CTA GGC TCC GAA GGA GAA GAC CTC ATC 774 

146 Leu Gly Val Ala Leu Lys Lys Leu Gly Ser Glu Gly Glu Asp Leu He 

147 235 240 245 
148 

149 CGT TTC TTC AAT TCC CTC AAC GAG GCG ACG ATG GTG CTG GTG TCC TGG 822 

150 Arg Phe Phe Asn Ser Leu Asn Glu Ala Thr Met Val Leu Val Ser Trp 

151 250 255 260 
152 

153 ATT ATG TGG TAC GTA CCT GTG GGC ATC ATG TTC CTT GTT GGA AGC AAG 8 70 
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154 lie Met Trp Tyr Val Pro Val Gly He Met Phe Leu Val Gly Ser Lys 

155 265 270 275 280 
156 

157 ATC GTG GAA ATG AAA GAG ATC ATC GTG CTG GTG ACC AGC CTG GGG AAA 918 

158 He Val Glu Met Lys Asp He He Val Leu Val Thr Ser Leu Gly Lys 

159 285 290 295 
160 

161 TAG ATC TTC GCA TCT ATA TTG GGC CAT GTT ATT CAT GGA GGA ATT GTT 966 

162 Tyr He Phe Ala Ser He Leu Gly His Val He His Gly Gly He Val 

163 300 305 310 
164 

165 CTG CCA CTT ATT TAT TTT GTT TTC ACA CGA AAA AAC CCA TTC AGA TTC 1014 

166 Leu Pro Leu He Tyr Phe Val Phe Thr Arg Lys Asn Pro Phe Arg Phe 

167 315 320 325 
168 

169 CTC CTG GGC CTC CTC GCC CCA TTT GCG ACA GCA TTT GCT ACC TGC TCC 1062 

170 Leu Leu Gly Leu Leu Ala Pro Phe Ala Thr Ala Phe Ala Thr Cys Ser 

171 330 335 340 
172 

173 AGC TCA GCG ACC CTT CCC TCT ATG ATG AAG TGC ATT GAA GAG AAC AAT 1110 

174 Ser Ser Ala Thr Leu Pro Ser Met Met Lys Cys He Glu Glu Asn Asn 

175 345 350 355 360 
176 

177 GGT GTG GAC AAG AGG ATC AGC AGG TTT ATT CTC CCC ATC GGG GCC ACC 1158 

178 Gly Val Asp Lys Arg He Ser Arg Phe He Leu Pro He Gly Ala Thr 

179 365 370 375 
180 

181 GTG AAC ATG GAC GGA GCA GCC ATC TTC CAG TGT GTG GCC GCG GTG TTC 12 06 

182 Val Asn Met Asp Gly Ala Ala He Phe Gin Cys Val Ala Ala Val Phe 

183 380 385 390 
184 

185 ATT GCG CAA CTC AAC AAC ATA GAG CTC AAC GCA GGA CAG ATT TTC ACC 1254 

186 He Ala Gin Leu Asn Asn He Glu Leu Asn Ala Gly Gin He Phe Thr 

187 395 400 405 
188 

189 ATT CTA GTG ACT GCC ACA GCG TCC AGT GTT GGA GCA GCA GGC GTG CCA 13 02 

190 He Leu Val Thr Ala Thr Ala Ser Ser Val Gly Ala Ala Gly Val Pro 

191 410 415 420 
192 

193 GCT GGA GGG GTC CTC ACC ATT GCC ATT ATC CTG GAG GCC ATT GGG CTG 13 50 

194 Ala Gly Gly Val Leu Thr He Ala He He Leu Glu Ala He Gly Leu 

195 425 430 435 440 
196 

197 CCT ACT CAT GAC CTG CCT CTG ATC CTG GCT GTG GAC TGG ATT GTG GAC 13 98 

198 Pro Thr His Asp Leu Pro Leu He Leu Ala Val Asp Trp He Val Asp 

199 445 450 455 
200 

2 01 CGG ACC ACC ACG GTG GTG AAT GTG GAG GGG GAT GCC CTG GGT GCA GGC 1446 
2 02 Arg Thr Thr Thr Val Val Asn Val Glu Gly Asp Ala Leu Gly Ala Gly 
203 460 465 470 

204 
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205 ATT CTC CAC CAC CTG AAT CAG AAG GCA ACA AAG AAA GGC GAG CAG GAA 14 94 

2 06 lie Leu His His Leu Asn Gin Lys Ala Thr Lys Lys Gly Glu Gin Glu 
207 475 480 485 

208 

2 09 CTT GCT GAG GTG AAA GTG GAA GCC ATC CCC AAC TGC AAG TCT GAG GAG 1542 

210 Leu Ala Glu Val Lys Val Glu Ala lie Pro Asn Cys Lys Ser Glu Glu 

211 490 495 500 
212 

213 GAG ACA TCG CCC CTG GTG ACA CAC CAG AAC CCC GCT GGC CCC GTG GCC 15 90 

214 Glu Thr Ser Pro Leu Val Thr His Gin Asn Pro Ala Gly Pro Val Ala 

215 505 510 515 520 
216 

217 AGT GCC CCA GAA CTG GAA TCC AAG GAG TCG GTT CTG TGATGGGGCT 1636 

218 Ser Ala Pro Glu Leu Glu Ser Lys Glu Ser Val Leu 

219 525 530 
220 

221 GGGCTTTGGG CTTGCCTGCC AGCAGTGATG TCCCACCCTG TTCA 1680 

222 

223 

224 (2) INFORMATION FOR SEQ ID NO : 3 : 
225 

226 (i) SEQUENCE CHARACTERISTICS: 

22 7 (A) LENGTH: 532 amino acids 

228 (B) TYPE: amino acid 

22 9 (D) TOPOLOGY: linear 

230 

231 (ii) MOLECULE TYPE: protein 

232 

233 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

234 

235 Met Glu Lys Ser Asn Glu Thr Asn Gly Tyr Leu Asp Ser Ala Gin Ala 

236 1 5 10 15 
237 

238 Gly Pro Ala Ala Gly Pro Gly Ala Pro Gly Thr Ala Ala Gly Arg Ala 

239 20 25 30 
240 

241 Arg Arg Cys Ala Arg Phe Leu Arg Arg Gin Ala Leu Val Leu Leu Thr 

242 35 40 45 
243 

244 Val Ser Gly Val Leu Ala Gly Ala Gly Leu Gly Ala Ala Leu Arg Gly 

245 50 55 60 
246 

24 7 Leu Ser Leu Ser Arg Thr Gin Val Thr Tyr Leu Ala Phe Pro Gly Glu 
248 65 70 75 80 

249 

2 50 Met Leu Leu Arg Met Leu Arg Met lie lie Leu Pro Leu Val Val Cys 
251 85 90 95 

252 

253 Ser Leu Val Ser Gly Ala Ala Ser Leu Asp Ala Ser Cys Leu Gly Arg 

254 100 105 110 
255 
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256 Leu Gly Gly lie Arg Val Ala Tyr Phe Gly Leu Thr Thr Leu Ser Ala 

257 115 120 125 
258 

259 Ser Ala Leu Ala Val Ala Leu Ala Phe lie lie Lys Pro Gly Ser Gly 

260 130 135 140 
261 

262 Ala Gin Thr Leu Gin Ser Ser Asp Leu Gly Leu Glu Asp Ser Gly Pro 

263 145 150 155 160 
264 

265 Pro Pro Val Pro Lys Glu Thr Val Asp Ser Phe Leu Asp Leu Ala Arg 

266 165 170 175 
267 

268 Asn Leu Phe Pro Ser Asn Leu Val Val Ala Ala Phe Arg Thr Tyr Ala 

269 180 185 190 
270 

2 71 Thr Asp Tyr Lys Val Val Thr Gin Asn Ser Ser Ser Gly Asn Val Thr 

272 195 200 205 

273 

2 74 His Glu Lys lie Pro lie Gly Thr Glu lie Glu Gly Met Asn lie Leu 

275 210 215 220 

276 

277 Gly Leu Val Leu Phe Ala Leu Val Leu Gly Val Ala Leu Lys Lys Leu 

278 225 230 235 240 
279 

28 0 Gly Ser Glu Gly Glu Asp Leu lie Arg Phe Phe Asn Ser Leu Asn Glu 
281 245 250 255 

282 

283 Ala Thr Met Val Leu Val Ser Trp He Met Trp Tyr Val Pro Val Gly 

284 260 265 270 
285 

286 He Met Phe Leu Val Gly Ser Lys He Val Glu Met Lys Asp He He 

287 275 280 285 
288 

289 Val Leu Val Thr Ser Leu Gly Lys Tyr He Phe Ala Ser He Leu Gly 

290 290 295 300 
291 

292 His Val He His Gly Gly He Val Leu Pro Leu He Tyr Phe Val Phe 

293 305 310 315 320 
294 

2 95 Thr Arg Lys Asn Pro Phe Arg Phe Leu Leu Gly Leu Leu Ala Pro Phe 
296 325 330 335 

297 

2 98 Ala Thr Ala Phe Ala Thr Cys Ser Ser Ser Ala Thr Leu Pro Ser Met 
299 340 345 350 

300 

3 01 Met Lys Cys He Glu Glu Asn Asn Gly Val Asp Lys Arg He Ser Arg 
302 355 360 365 

303 

3 04 Phe He Leu Pro He Gly Ala Thr Val Asn Met Asp Gly Ala Ala He 

305 370 375 380 

306 
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PATENT APPLICATION US/08/140, 729A TIME: 1 1 :22:42 
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307 Phe Gin Cys Val Ala Ala Val Phe He Ala Gin Leu Asn Asn He Glu 

308 385 390 395 400 
309 

310 Leu Asn Ala Gly Gin He Phe Thr He Leu Val Thr Ala Thr Ala Ser 

311 405 410 415 
312 

313 

314 Ser Val Gly Ala Ala Gly Val Pro Ala Gly Gly Val Leu Thr He Ala 

315 420 425 430 
316 

317 He He Leu Glu Ala He Gly Leu Pro Thr His Asp Leu Pro Leu He 

318 435 440 445 
319 

320 Leu Ala Val Asp Trp He Val Asp Arg Thr Thr Thr Val Val Asn Val 

321 450 455 460 
322 

323 Glu Gly Asp Ala Leu Gly Ala Gly He Leu His His Leu Asn Gin Lys 

324 465 470 475 480 
325 

326 Ala Thr Lys Lys Gly Glu Gin Glu Leu Ala Glu Val Lys Val Glu Ala 

327 485 490 495 
328 

329 He Pro Asn Cys Lys Ser Glu Glu Glu Thr Ser Pro Leu Val Thr His 

330 500 505 510 
331 

3 32 Gin Asn Pro Ala Gly Pro Val Ala Ser Ala Pro Glu Leu Glu Ser Lys 

333 515 520 525 

334 

3 35 Glu Ser Val Leu 

336 530 

337 

338 (2) INFORMATION FOR SEQ ID NO : 4 : 
339 

340 (i) SEQUENCE CHARACTERISTICS: 

341 (A) LENGTH: 1680 base pairs 

342 (B) TYPE: nucleic acid 

343 (C) STRANDEDNESS : single 

344 (D) TOPOLOGY: linear 
345 

346 (ii) MOLECULE TYPE: cDNA 

347 

34 8 (ix) FEATURE: 

349 (A) NAME/KEY: 5 ' UTR 

350 (B) LOCATION: 1. .30 
351 

352 (ix) FEATURE: 

353 (A) NAME/KEY: CDS 

354 (B) LOCATION: 31.. 1656 
355 

356 (ix) FEATURE: 

357 (A) NAME/KEY: 3 ' UTR 
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358 






(B) LOCATION: 


1657. .1680 


















359 




































360 
361 




(xi) 


i SEQUENCE DESCRIPTION: J 


SEQ 


ID NO: 4: 














362 


AAAGAAGAGA ( 


::CCTCCTAGA AAAGTAAAAT ATG ACT AAA AGC AAT GGA GAA GAG 


54 


363 


















Met Thr Lys Ser Asn Gly Glu Glu 




364 




















1 






1 


5 








365 




































366 


CCC 


AAG 


ATG 


GGG 


GGC 


AGG 


ATG 


GAG 


AGA 


TTC 


CAG 


CAG 


GGA 


GTC 


CGT 


AAA 


102 


367 


Pro 


Lys 


Met 


Gly 


Gly 


Arg 


Met 


Glu 


Arg 


Phe 


Gin 


Gin 


Gly 


Val 




J— 1^ o 




368 




10 










15 










20 












369 




































370 


CGC 


ACA 


CTT 


TTG 


GCC 


AAG 


AAG 


AAA 


GTG 


CAG 


AAC 


ATT 


ACA 


AAG 


GAG 


GTT 


150 


371 


Arg 


Thr 


Leu 


Leu 


Ala 


Lys 


Lys 


Lys 


Val 


Gin 


Asn 


He 


Thr 


Lys 


Glu 


VAX 




372 


25 










30 










35 










40 




373 




































374 


GTT 


AAA 


AGT 


TAC 


CTG 


TTT 


CGG 


AAT 


GCT 


TTT 


GTG 


CTG 


CTC 


ACA 


GTC 


ACC 


198 


375 


Val 


Lys 


Ser 


Tyr 


Leu 


Phe 


Arcr 


Asn 


Ala 


Phe 


Val 


Leu 


Leu 


Thr 


Val 


Thr 




376 










45 










50 










55 






377 




































378 


GCT 


GTC 


ATT 


GTG 


GGT 


ACA 


ATC 


CTT 


GGA 


TTT 


ACC 


CTC 


CGA 


CCA 


TAC 


AGA 


246 


379 


Ala 


Val 


He 


Val 


Gly 


Thr 


He 


Leu 


Gly 


Phe 


Thr 


Leu 


Arg 


Pro 


Tyr 

X _y J- 


Arg 




380 








60 










65 










70 








381 




































382 


ATG 


AGC 


TAC 


CGG 


GAA 


GTC 


AAG 


TAC 


TTC 


TCC 


TTT 


CCT 


GGG 


GAA 


CTT 


CTG 


294 


383 


Met 


Ser 


Tyr 


Arg 


Glu 


Val 


Lys 


Tyr 


Phe 


Ser 


Phe 


Pro 


Gly 


Glu 


Leu 


Leu 




384 






75 










80 










85 










385 




































386 


ATG 


AGG 


ATG 


TTA 


CAG 


ATG 


CTG 


GTC 


TTA 


CCA 


CTT 


ATC 


ATC 


TCC 


AGT 


CTT 


342 


387 


Met 


Arg 


Met 


Leu 


Gin 


Met 


Leu 


Val 


Leu 


Pro 


Leu 


He 


He 


Ser 


Ser 


Leu 




388 




90 










95 










100 












389 




































390 


GTC 


ACA 


GGA 


ATG 


GCG 


GCG 


CTA 


GAT 


AGT 


AAG 


GCA 


TCA 


GGG 


AAG 


TGG 


GAA 


390 


391 


Val 


Thr 


Gly 


Met 


Ala 


Ala 


Leu 


Asp 


Ser 


Lys 


Ala 


Ser 


Gly 


Lys 


Trp 


Glu 




392 


105 










110 










115 










120 




393 




































394 


TGC 


GGA 


GCT 


GTA 


GTC 


TAT 


TAT 


ATG 


ACT 


ACC 


ACC 


ATC 


ATT 


GCT 


GTG 


GTG 


438 


395 


Cys 


Gly 


Ala 


Val 


Val 


Tyr 


Tyr 


Met 


Thr 


Thr 


Thr 


He 


He 


Ala 


Val 


Val 




396 










125 










130 










135 






397 




































398 


ATT 


GGC 


ATA 


ATC 


ATT 


GTC 


ATC 


ATC 


ATC 


CAT 


CCT 


GGG 


AAG 


GGC 


ACA 


AAG 


486 


399 


He 


Gly 


He 


He 


He 


Val 


He 


He 


He 


His 


Pro 


Gly 


Lys 


Gly 


Thr 


Lys 




400 








140 










145 










150 








401 




































402 


GAA 


AAC 


ATG 


CAC 


AGA 


GAA 


GGC 


AAA 


ATT 


GTA 


CGA 


GTG 


ACA 


GCT 


GCA 


GAT 


534 


403 


Glu 


Asn 


Met 


His 


Arg 


Glu 


Gly 


Lys 


He 


Val 


Arg 


Val 


Thr 


Ala 


Ala 


Asp 




4 04 






155 










160 










165 










405 




































406 


GCC 


TTC 


CTG 


GAC 


TTG 


ATC 


AGG 


AAC 


ATG 


TTA 


AAT 


CCA 


AAT 


CTG 


GTA 


GAA 


582 


407 


Ala 


Phe 


Leu 


Asp 


Leu 


He 


Arg 


Asn 


Met 


Leu 


Asn 


Pro 


Asn 


Leu 


Val 


Glu 




408 




170 










175 










180 
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409 

410 GCC TGC TTT AAA CAG TTT AAA ACC AAC TAT GAG AAG AGA AGC TTT AAA 63 0 

411 Ala Cys Phe Lys Gin Phe Lys Thr Asn Tyr Glu Lys Arg Ser Phe Lys 

412 185 190 195 200 
413 

414 GTG CCC ATC CAG GCC AAC GAA ACG CTT GTG GGT GCT GTG ATA AAC AAT 678 

415 Val Pro lie Gin Ala Asn Glu Thr Leu Val Gly Ala Val lie Asn Asn 

416 205 210 215 
417 

418 GTG TCT GAG GCC ATG GAG ACT CTT ACC CGA ATC ACA GAG GAG CTG GTC 726 

419 Val Ser Glu Ala Met Glu Thr Leu Thr Arg lie Thr Glu Glu Leu Val 

420 220 225 230 
421 

422 CCA GTT CCA GGA TCT GTG AAT GGA GTC AAT GCC CTG GGT CTA GTT GTC 774 

423 Pro Val Pro Gly Ser Val Asn Gly Val Asn Ala Leu Gly Leu Val Val 

424 235 240 245 
425 

426 TTC TCC ATG TGC TTC GGT TTT GTG ATT GGA AAC ATG AAG GAA CAG GGG 822 

42 7 Phe Ser Met Cys Phe Gly Phe Val lie Gly Asn Met Lys Glu Gin Gly 
428 250 255 260 

429 

43 0 CAG GCC CTG AGA GAG TTC TTT GAT TCT CTT AAC GAA GCC ATC ATG AGA 870 

431 Gin Ala Leu Arg Glu Phe Phe Asp Ser Leu Asn Glu Ala lie Met Arg 

432 265 270 275 280 
433 

434 CTG GTA GCA GTA ATA ATG TGG TAT GCC CCC GTG GGT ATT CTC TTC CTG 918 

435 Leu Val Ala Val He Met Trp Tyr Ala Pro Val Gly He Leu Phe Leu 

436 285 290 295 
437 

43 8 ATT GCT GGG AAG ATT GTG GAG ATG GAA GAC ATG GGT GTG ATT GGG GGG 966 

43 9 He Ala Gly Lys He Val Glu Met Glu Asp Met Gly Val He Gly Gly 
440 300 305 310 

441 

442 CAG CTT GCC ATG TAC ACC GTG ACT GTC ATT GTT GGC TTA CTC ATT CAC 1014 

443 Gin Leu Ala Met Tyr Thr Val Thr Val He Val Gly Leu Leu He His 

444 315 320 325 
445 

446 GCA GTC ATC GTC TTG CCA CTC CTC TAC TTC TTG GTA ACA CGG AAA AAC 1062 

44 7 Ala Val He Val Leu Pro Leu Leu Tyr Phe Leu Val Thr Arg Lys Asn 
448 330 335 340 

449 

450 CCT TGG GTT TTT ATT GGA GGG TTG CTG CAA GCA CTC ATC ACC GCT CTG 1110 

451 Pro Trp Val Phe He Gly Gly Leu Leu Gin Ala Leu He Thr Ala Leu 

452 345 350 355 360 
453 

454 GGG ACC TCT TCA AGT TCT GCC ACC CTA CCC ATC ACC TTC AAG TGC CTG 1158 

455 Gly Thr Ser Ser Ser Ser Ala Thr Leu Pro He Thr Phe Lys Cys Leu 

456 365 370 375 
457 

458 GAA GAG AAC AAT GGC GTG GAC AAG CGC GTC ACC AGA TTC GTG CTC CCC 12 06 

45 9 Glu Glu Asn Asn Gly Val Asp Lys Arg Val Thr Arg Phe Val Leu Pro 
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460 380 385 390 

461 

462 GTA GGA GCC ACC ATT AAC ATG GAT GGG ACT GCC CTC TAT GAG GCT TTG 1254 

463 Val Gly Ala Thr lie Asn Met Asp Gly Thr Ala Leu Tyr Glu Ala Leu 

464 395 400 405 
465 

466 GCT GCC ATT TTC ATT GCT CAA GTT AAC AAC TTT GAA CTG AAC TTC GGA 13 02 

467 Ala Ala lie Phe lie Ala Gin Val Asn Asn Phe Glu Leu Asn Phe Gly 

468 410 415 420 
469 

470 CAA ATT ATT ACA ATC AGC ATC ACA GCC ACA GCT GCC AGT ATT GGG GCA 13 50 

471 Gin He He Thr He Ser He Thr Ala Thr Ala Ala Ser He Gly Ala 

472 425 430 435 440 
473 

474 GCT GGA ATT CCT CAG GCG GGC CTG GTC ACT ATG GTC ATT GTG CTG ACA 13 98 

475 Ala Gly He Pro Gin Ala Gly Leu Val Thr Met Val He Val Leu Thr 

476 445 450 455 
477 

478 TCT GTC GGC CTG CCC ACT GAC GAC ATC ACG CTC ATC ATC GCG GTG GAC 1446 

479 Ser Val Gly Leu Pro Thr Asp Asp He Thr Leu He He Ala Val Asp 

480 460 465 470 
481 

482 TGG TTC TTG GAT CGC CTC CGG ACC ACC ACC AAC GTA CTG GGA GAC TCC 14 94 

483 Trp Phe Leu Asp Arg Leu Arg Thr Thr Thr Asn Val Leu Gly Asp Ser 

484 475 480 485 
485 

486 CTG GGA GCT GGG ATT GTG GAG CAC TTG TCA CGA CAT GAA CTG AAG AAC 1542 

487 Leu Gly Ala Gly He Val Glu His Leu Ser Arg His Glu Leu Lys Asn 

488 490 495 500 
489 

490 AGA GAT GTT GAA ATG GGT AAC TCA GTG ATT GAA GAG AAT GAA ATG AAG 1590 

491 Arg Asp Val Glu Met Gly Asn Ser Val He Glu Glu Asn Glu Met Lys 

492 505 510 515 520 
493 

494 AAA CCA TAT CAA CTG ATT GCA CAG GAC AAT GAA ACT GAG AAA CCC ATC 1638 

495 Lys Pro Tyr Gin Leu He Ala Gin Asp Asn Glu Thr Glu Lys Pro He 

496 525 530 535 
497 

4 98 GAC AGT GAA ACC AAG ATG TAGACTAACA TAAAGAAACA CTTT 1680 

4 99 Asp Ser Glu Thr Lys Met 

500 540 

501 

502 

503 (2) INFORMATION FOR SEQ ID NO : 5 : 
504 

505 (i) SEQUENCE CHARACTERISTICS: 

506 (A) LENGTH: 542 amino acids 

507 (B) TYPE: amino acid 

508 (D) TOPOLOGY: linear 
509 

510 (ii) MOLECULE TYPE: protein 



PAGE: 1 1 RAW SEQUENCE LISTING DATE: 03/04/94 

PATENT APPLICATION US/08/140, 729 A TIME: 1 1 :23:08 



INPUT SET: S7433.raw 



511 

512 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

513 

514 Met Thr Lys Ser Asn Gly Glu Glu Pro Lys Met Gly Gly Arg Met Glu 

515 15 10 15 
516 

517 Arg Phe Gin Gin Gly Val Arg Lys Arg Thr Leu Leu Ala Lys Lys Lys 

518 20 25 30 
519 

52 0 Val Gin Asn lie Thr Lys Glu Val Val Lys Ser Tyr Leu Phe Arg Asn 

521 35 40 45 

522 

523 Ala Phe Val Leu Leu Thr Val Thr Ala Val He Val Gly Thr He Leu 

524 50 55 60 
525 

526 Gly Phe Thr Leu Arg Pro Tyr Arg Met Ser Tyr Arg Glu Val Lys Tyr 

527 65 70 75 80 
528 

52 9 Phe Ser Phe Pro Gly Glu Leu Leu Met Arg Met Leu Gin Met Leu Val 
530 85 90 95 
531 

532 Leu Pro Leu He He Ser Ser Leu Val Thr Gly Met Ala Ala Leu Asp 

533 100 105 110 
'534 

53 5 Ser Lys Ala Ser Gly Lys Trp Glu Cys Gly Ala Val Val Tyr Tyr Met 
536 115 120 125 

537 

538 Thr Thr Thr He He Ala Val Val He Gly He He He Val He He 

539 130 135 140 
540 

541 He His Pro Gly Lys Gly Thr Lys Glu Asn Met His Arg Glu Gly Lys 

542 145 150 155 160 
543 

544 He Val Arg Val Thr Ala Ala Asp Ala Phe Leu Asp Leu He Arg Asn 

545 165 170 175 
546 

547 Met Leu Asn Pro Asn Leu Val Glu Ala Cys Phe Lys Gin Phe Lys Thr 

548 180 185 190 
549 

550 Asn Tyr Glu Lys Arg Ser Phe Lys Val Pro He Gin Ala Asn Glu Thr 

551 195 200 205 
552 

553 Leu Val Gly Ala Val He Asn Asn Val Ser Glu Ala Met Glu Thr Leu 

554 210 215 220 
555 

556 Thr Arg He Thr Glu Glu Leu Val Pro Val Pro Gly Ser Val Asn Gly 

557 225 230 235 240 
558 

559 Val Asn Ala Leu Gly Leu Val Val Phe Ser Met Cys Phe Gly Phe Val 

560 245 250 255 
561 
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562 lie Gly Asn Met Lys Glu Gin Gly Gin Ala Leu Arg Glu Phe Phe Asp 

563 260 265 270 
564 

565 Ser Leu Asn Glu Ala lie Met Arg Leu Val Ala Val lie Met Trp Tyr 

566 275 280 285 
567 

568 Ala Pro Val Gly lie Leu Phe Leu lie Ala Gly Lys lie Val Glu Met 

569 290 295 300 
570 

571 Glu Asp Met Gly Val lie Gly Gly Gin Leu Ala Met Tyr Thr Val Thr 

572 305 310 315 320 
573 

574 

575 Val lie Val Gly Leu Leu lie His Ala Val He Val Leu Pro Leu Leu 

576 325 330 335 
577 

578 Tyr Phe Leu Val Thr Arg Lys Asn Pro Trp Val Phe He Gly Gly Leu 

579 340 345 350 
580 

581 Leu Gin Ala Leu He Thr Ala Leu Gly Thr Ser Ser Ser Ser Ala Thr 

582 355 360 365 
583 

584 Leu Pro He Thr Phe Lys Cys Leu Glu Glu Asn Asn Gly Val Asp Lys 

585 370 375 380 
586 

587 Arg Val Thr Arg Phe Val Leu Pro Val Gly Ala Thr He Asn Met Asp 

588 385 390 395 400 
589 

590 Gly Thr Ala Leu Tyr Glu Ala Leu Ala Ala He Phe He Ala Gin Val 

591 405 410 415 
592 

593 Asn Asn Phe Glu Leu Asn Phe Gly Gin He He Thr He Ser He Thr 

594 420 425 430 
595 

596 Ala Thr Ala Ala Ser He Gly Ala Ala Gly He Pro Gin Ala Gly Leu 

597 435 440 445 
598 

599 Val Thr Met Val He Val Leu Thr Ser Val Gly Leu Pro Thr Asp Asp 

600 450 455 460 
601 

602 He Thr Leu He He Ala Val Asp Trp Phe Leu Asp Arg Leu Arg Thr 

603 465 470 475 480 
604 

605 Thr Thr Asn Val Leu Gly Asp Ser Leu Gly Ala Gly He Val Glu His 

606 485 490 495 
607 

608 Leu Ser Arg His Glu Leu Lys Asn Arg Asp Val Glu Met Gly Asn Ser 

609 500 505 510 
610 

611 Val He Glu Glu Asn Glu Met Lys Lys Pro Tyr Gin Leu He Ala Gin 

612 515 520 525 
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613 

614 Asp Asn Glu Thr Glu Lys Pro lie Asp Ser Glu Thr Lys Met 

615 530 535 540 
616 

617 

618 (2) INFORMATION FOR SEQ ID NO : 6 : 
619 

620 (i) SEQUENCE CHARACTERISTICS: 

621 (A) LENGTH: 1800 base pairs 

622 (B) TYPE: nucleic acid 

623 (C) STRANDEDNESS : single 

624 (D) TOPOLOGY: linear 
625 

626 (ii) MOLECULE TYPE: cDNA 

627 

62 8 (ix) FEATURE: 

62 9 (A) NAME/KEY: 5 ' UTR 
630 (B) LOCATION: 1..33 
631 

632 (ix) FEATURE: 

63 3 (A) NAME/KEY: CDS 

634 (B) LOCATION: 34. .1755 

635 

636 (ix) FEATURE: 

63 7 (A) NAME/KEY: 3 ' UTR 

638 (B) LOCATION: 1756.. 1800 

639 

64 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
641 

642 GATAGTGCTG AAGAGGAGGG GCGTTCCCAG ACC ATG GCA TCT ACG GAA GGT GCC 54 

643 Met Ala Ser Thr Glu Gly Ala 

644 1 5 
645 

646 AAC AAT ATG CCC AAG CAG GTG GAA GTG CGA ATG CCA GAC AGT CAT CTT 102 

647 Asn Asn Met Pro Lys Gin Val Glu Val Arg Met Pro Asp Ser His Leu 

648 10 15 20 
649 

650 GGC TCA GAG GAA CCC AAG CAC CGG CAC CTG GGC CTG CGC CTG TGT GAC 150 

651 Gly Ser Glu Glu Pro Lys His Arg His Leu Gly Leu Arg Leu Cys Asp 

652 25 30 35 
653 

654 AAG CTG GGG AAG AAT CTG CTG CTC ACC CTG ACG GTG TTT GGT GTC ATC 198 
6 55 Lys Leu Gly Lys Asn Leu Leu Leu Thr Leu Thr Val Phe Gly Val lie 
656 ■ 40 45 50 55 

657 

6 58 CTG GGA GCA GTG TGT GGA GGG CTT CTT CGC TTG GCA TCT CCC ATC CAC 246 

659 Leu Gly Ala Val Cys Gly Gly Leu Leu Arg Leu Ala Ser Pro lie His 

660 60 65 70 
661 

662 CCT GAT GTG GTT ATG TTA ATA GCC TTC CCA GGG GAT ATA CTC ATG AGG 2 94 

663 Pro Asp Val Val Met Leu lie Ala Phe Pro Gly Asp lie Leu Met Arg 
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INPUT SET: S7433.raw 

664 75 80 85 

665 

666 ATG CTA AAA ATG CTC ATT CTG GOT CTA ATC ATC TCC AGC TTA ATC ACA 342 

667 Met Leu Lys Met Leu lie Leu Gly Leu lie lie Ser Ser Leu lie Thr 

668 90 95 100 
669 

670 GGG TTG TCA GGC CTG GAT GCT AAG GCT AGT GGC CGC TTG GGC ACG AGA 3 90 

671 Gly Leu Ser Gly Leu Asp Ala Lys Ala Ser Gly Arg Leu Gly Thr Arg 

672 105 110 115 
673 

674 GCC ATG GTG TAT TAG ATG TCC ACG ACC ATC ATT GCT GCA GTA CTG GGG 438 

675 Ala Met Val Tyr Tyr Met Ser Thr Thr lie He Ala Ala Val Leu Gly 

676 120 125 130 135 
677 

678 GTC ATT CTG GTC TTG GCT ATC CAT CCA GGC AAT CCC AAG CTC AAG AAG 486 

679 Val He Leu Val Leu Ala He His Pro Gly Asn Pro Lys Leu Lys Lys 

680 140 145 150 
681 

682 CAG CTG GGG CCT GGG AAG AAG AAT GAT GAA GTG TCC AGC CTG GAT GCC 534 

683 Gin Leu Gly Pro Gly Lys Lys Asn Asp Glu Val Ser Ser Leu Asp Ala 

684 155 160 165 
685 

686 TTC CTG GAC CTT ATT CGA AAT CTC TTC CCT GAA AAC CTT GTC CAA GCC 582 
68 7 Phe Leu Asp Leu He Arg Asn Leu Phe Pro Glu Asn Leu Val Gin Ala 
688 170 175 180 

689 

690 TGC TTT CAA CAG ATT CAA ACA GTG ACG AAG AAA GTC CTG GTT GCA CCA 63 0 

691 Cys Phe Gin Gin He Gin Thr Val Thr Lys Lys Val Leu Val Ala Pro 

692 185 190 195 
693 

694 CCG CCA GAC GAG GAG GCC AAC GCA ACC AGC GCT GAA GTC TCT CTG TTG 6 78 

695 Pro Pro Asp Glu Glu Ala Asn Ala Thr Ser Ala Glu Val Ser Leu Leu 

696 200 205 210 215 
697 

698 AAC GAG ACT GTG ACT GAG GTG CCG GAG GAG ACT AAG ATG GTT ATC AAG 726 

699 Asn Glu Thr Val Thr Glu Val Pro Glu Glu Thr Lys Met Val He Lys 

700 220 225 230 
701 

702 AAG GGC CTG GAG TTC AAG GAT GGG ATG AAC GTC TTA GGT CTG ATA GGG 774 

703 Lys Gly Leu Glu Phe Lys Asp Gly Met Asn Val Leu Gly Leu He Gly 

704 235 240 245 
705 

706 TTT TTC ATT GCT TTT GGC ATC GCT ATG GGG AAG ATG GGA GAT CAG GCC 822 

707 Phe Phe He Ala Phe Gly He Ala Met Gly Lys Met Gly Asp Gin Ala 

708 250 255 260 
709 

710 AAG CTG ATG GTG GAT TTC TTC AAC ATT TTG AAT GAG ATT GTA ATG AAG 8 70 

711 Lys Leu Met Val Asp Phe Phe Asn He Leu Asn Glu He Val Met Lys 

712 265 270 275 
713 

714 TTA GTG ATC ATG ATC ATG TGG TAC TCT CCC CTG GGT ATC GCC TGC CTG 918 
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715 Leu Val lie Met lie Met Trp Tyr Ser Pro Leu Gly lie Ala Cys Leu 

716 280 285 290 295 
717 

718 ATC TGT GGA AAG ATC ATT GCA ATC AAG GAG TTA GAA GTG GTT GCT AGG 966 

719 lie Cys Gly Lys lie lie Ala lie Lys Asp Leu Glu Val Val Ala Arg 

720 300 305 310 
721 

722 CAA CTG GGG ATG TAG ATG GTA ACA GTG ATC ATA GGC CTC ATC ATC CAC 1014 

723 Gin Leu Gly Met Tyr Met Val Thr Val He He Gly Leu He He His 

724 315 320 325 
725 

726 GGG GGC ATC TTT CTC CCC TTG ATT TAC TTT GTA GTG ACC AGG AAA AAC 1062 

72 7 Gly Gly He Phe Leu Pro Leu He Tyr Phe Val Val Thr Arg Lys Asn 
728 330 335 340 

729 

73 0 CCC TTC TCC CTT TTT GCT GGC ATT TTC CAA GCT TGG ATC ACT GCC CTG 1110 

731 Pro Phe Ser Leu Phe Ala Gly He Phe Gin Ala Trp He Thr Ala Leu 

732 345 350 355 
733 

734 GGC ACC GCT TCC AGT GCT GGA ACT TTG CCT GTC ACC TTT CGT TGC CTG 1158 
73 5 Gly Thr Ala Ser Ser Ala Gly Thr Leu Pro Val Thr Phe Arg Cys Leu 
736 360 365 370 375 

737 

738 GAA GAA AAT CTG GGG ATT GAT AAG CGT GTG ACT AGA TTC GTC CTT CCT 12 06 

73 9 Glu Glu Asn Leu Gly He Asp Lys Arg Val Thr Arg Phe Val Leu Pro 
740 380 385 390 

741 

742 GTT GGA GCA ACC ATT AAC ATG GAT GGT ACA GCC CTT TAT GAA GCG GTG 12 54 

743 Val Gly Ala Thr He Asn Met Asp Gly Thr Ala Leu Tyr Glu Ala Val 

744 395 400 405 
745 

746 GCC GCC ATC TTT ATA GCC CAA ATG AAT GGT GTT GTC CTG GAT GGA GGA 13 02 

747 Ala Ala He Phe He Ala Gin Met Asn Gly Val Val Leu Asp Gly Gly 

748 410 415 420 
749 

750 CAG ATT GTG ACT GTA AGC CTC ACA GCC ACC CTG GCA AGC GTC GGC GCG 1350 

751 Gin He Val Thr Val Ser Leu Thr Ala Thr Leu Ala Ser Val Gly Ala 

752 425 430 435 
753 

754 GCC AGT ATC CCC AGT GCC GGG CTG GTC ACC ATG CTC CTC ATT CTG ACA 13 98 

755 Ala Ser He Pro Ser Ala Gly Leu Val Thr Met Leu Leu He Leu Thr 

756 440 445 450 455 
757 

758 GCC GTG GGC CTG CCA ACA GAG GAC ATC AGC TTG CTG GTG GCT GTG GAC 1446 

759 Ala Val Gly Leu Pro Thr Glu Asp He Ser Leu Leu Val Ala Val Asp 

760 460 465 470 
761 

762 TGG CTG CTG GAC AGG ATG AGA ACT TCA GTC AAT GTT GTG GGT GAC TCT 1494 

763 Trp Leu Leu Asp Arg Met Arg Thr Ser Val Asn Val Val Gly Asp Ser 

764 475 480 485 
765 
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766 TTT GGG GCT GGG ATA GTC TAT CAC CTC TCC AAG TCT GAG CTG GAT ACC 1542 

767 Phe Gly Ala Gly lie Val Tyr His Leu Ser Lys Ser Glu Leu Asp Thr 

768 490 495 500 
769 

770 ATT GAG TCC CAG CAT CGA GTG CAT GAA GAT ATT GAA ATG ACC AAG ACT 1590 

771 lie Asp Ser Gin His Arg Val His Glu Asp lie Glu Met Thr Lys Thr 

772 505 510 515 
773 

774 CAA TCC ATT TAT GAT GAC ATG AAG AAC CAC AGG GAA AGC AAC TCT AAT 1638 

775 Gin Ser lie Tyr Asp Asp Met Lys Asn His Arg Glu Ser Asn Ser Asn 

776 520 525 530 535 
777 

778 CAA TGT GTC TAT GCT GCA CAC AAC TCT GTC ATA GTA GAT GAA TGC AAG 1686 

779 Gin Cys Val Tyr Ala Ala His Asn Ser Val lie Val Asp Glu Cys Lys 

780 540 545 550 
781 

782 GTA ACT CTG GCA GCC AAT GGA AAG TCA GCC GAC TGC AGT GTT GAG GAA 1734 

783 Val Thr Leu Ala Ala Asn Gly Lys Ser Ala Asp Cys Ser Val Glu Glu 

784 555 560 565 
785 

786 GAA CCT TGG AAA CGT GAG AAA TAAGGATATG AGTCTCAGCA AATTCTTGAA 1785 

787 Glu Pro Trp Lys Arg Glu Lys 

788 570 
789 

790 TAAACTCCCC AGCGT 1800 

791 

792 

793 (2) INFORMATION FOR SEQ ID NO : 7 : 
794 

795 (i) SEQUENCE CHARACTERISTICS: 

796 (A) LENGTH: 574 amino acids 

797 (B) TYPE: amino acid 

798 (D) TOPOLOGY: linear 
799 

800 (ii) MOLECULE TYPE: protein 

801 

802 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

803 

8 04 Met Ala Ser Thr Glu Gly Ala Asn Asn Met Pro Lys Gin Val Glu Val 
805 15 10 15 

806 

807 Arg Met Pro Asp Ser His Leu Gly Ser Glu Glu Pro Lys His Arg His 

808 20 25 30 
809 

810 Leu Gly Leu Arg Leu Cys Asp Lys Leu Gly Lys Asn Leu Leu Leu Thr 

811 35 40 45 
812 

813 Leu Thr Val Phe Gly Val lie Leu Gly Ala Val Cys Gly Gly Leu Leu 

814 50 55 60 
815 

816 Arg Leu Ala Ser Pro He His Pro Asp Val Val Met Leu He Ala Phe 
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817 65 70 75 80 

818 

819 Pro Gly Asp lie Leu Met Arg Met Leu Lys Met Leu lie Leu Gly Leu 

820 85 90 95 
821 

822 lie lie Ser Ser Leu lie Thr Gly Leu Ser Gly Leu Asp Ala Lys Ala 

823 100 105 110 
824 

82 5 Ser Gly Arg Leu Gly Thr Arg Ala Met Val Tyr Tyr Met Ser Thr Thr 

826 115 120 125 

827 

82 8 lie lie Ala Ala Val Leu Gly Val He Leu Val Leu Ala He His Pro 
829 130 135 140 

830 

831 Gly Asn Pro Lys Leu Lys Lys Gin Leu Gly Pro Gly Lys Lys Asn Asp 

832 145 150 155 160 
833 

834 Glu Val Ser Ser Leu Asp Ala Phe Leu Asp Leu He Arg Asn Leu Phe 

835 165 170 175 
836 

83 7 Pro Glu Asn Leu Val Gin Ala Cys Phe Gin Gin He Gin Thr Val Thr 
838 180 185 190 

839 

84 0 Lys Lys Val Leu Val Ala Pro Pro Pro Asp Glu Glu Ala Asn Ala Thr 
841 195 200 205 

842 

843 Ser Ala Glu Val Ser Leu Leu Asn Glu Thr Val Thr Glu Val Pro Glu 

844 210 215 220 
845 

846 Glu Thr Lys Met Val He Lys Lys Gly Leu Glu Phe Lys Asp Gly Met 

847 225 230 235 240 
848 

84 9 Asn Val Leu Gly Leu He Gly Phe Phe He Ala Phe Gly He Ala Met 
850 245 250 255 

851 

852 Gly Lys Met Gly Asp Gin Ala Lys Leu Met Val Asp Phe Phe Asn He 

853 260 265 270 
854 

855 Leu Asn Glu He Val Met Lys Leu Val He Met He Met Trp Tyr Ser 

856 275 280 285 
857 

858 Pro Leu Gly He Ala Cys Leu He Cys Gly Lys He He Ala He Lys 

859 290 295 300 
860 

861 Asp Leu Glu Val Val Ala Arg Gin Leu Gly Met Tyr Met Val Thr Val 

862 305 310 315 320 
863 

864 He He Gly Leu He He His Gly Gly He Phe Leu Pro Leu He Tyr 

865 325 330 335 
866 

867 Phe Val Val Thr Arg Lys Asn Pro Phe Ser Leu Phe Ala Gly He Phe 
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868 340 345 350 

869 

870 Gin Ala Trp lie Thr Ala Leu Gly Thr Ala Ser Ser Ala Gly Thr Leu 

871 355 360 365 
872 

873 

874 Pro Val Thr Phe Arg Cys Leu Glu Glu Asn Leu Gly lie Asp Lys Arg 

875 370 375 380 
876 

877 Val Thr Arg Phe Val Leu Pro Val Gly Ala Thr He Asn Met Asp Gly 

878 385 390 395 400 
879 

880 Thr Ala Leu Tyr Glu Ala Val Ala Ala He Phe He Ala Gin Met Asn 

881 405 410 415 
882 

883 Gly Val Val Leu Asp Gly Gly Gin He Val Thr Val Ser Leu Thr Ala 

884 420 425 430 
885 

886 Thr Leu Ala Ser Val Gly Ala Ala Ser He Pro Ser Ala Gly Leu Val 

887 435 440 445 
888 

889 Thr Met Leu Leu He Leu Thr Ala Val Gly Leu Pro Thr Glu Asp He 

890 450 455 460 
891 

8 92 Ser Leu Leu Val Ala Val Asp Trp Leu Leu Asp Arg Met Arg Thr Ser 
893 465 470 475 480 

894 

8 95 Val Asn Val Val Gly Asp Ser Phe Gly Ala Gly He Val Tyr His Leu 
896 485 490 495 

897 

8 98 Ser Lys Ser Glu Leu Asp Thr He Asp Ser Gin His Arg Val His Glu 

899 500 505 510 

900 

901 Asp He Glu Met Thr Lys Thr Gin Ser He Tyr Asp Asp Met Lys Asn 

902 515 520 525 
903 

904 His Arg Glu Ser Asn Ser Asn Gin Cys Val Tyr Ala Ala His Asn Ser 

905 530 535 540 
906 

907 Val He Val Asp Glu Cys Lys Val Thr Leu Ala Ala Asn Gly Lys Ser 

908 545 550 555 560 
909 

910 Ala Asp Cys Ser Val Glu Glu Glu Pro Trp Lys Arg Glu Lys 

911 565 570 
912 

913 (2) INFORMATION FOR SEQ ID NO : 8 : 
914 

915 (i) SEQUENCE CHARACTERISTICS: 

. 916 (A) LENGTH: 16 74 base pairs 

917 (B) TYPE: nucleic acid 

918 (C) STRANDEDNESS : single . 
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919 (D) TOPOLOGY: linear 

920 

921 (ii) MOLECULE TYPE: cDNA 

922 

923 (ix) FEATURE: 

924 (A) NAME/KEY: 5 ' UTR 

925 (B) LOCATION: 1 . . 15 
926 

927 (ix) FEATURE: 

928 (A) NAME/KEY: CDS 

929 (B) LOCATION: 16.. 1590 
930 

931 (ix) FEATURE: 

932 (A) NAME/KEY: 3 ' UTR 

933 (B) LOCATION: 1591.. 1674 
934 

935 

936 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

937 

93 8 ATAGCGGCGA CAGCC ATG GGG AAA CCG GCG AGG AAA GGA TGC CCG AGT TGG 51 
93 9 Met Gly Lys Pro Ala Arg Lys Gly Cys Pro Ser Trp 

940 1 5 10 

941 

942 AAG CGC TTC CTG AAG AAT AAC TGG GTG TTG CTG TCC ACC GTG GCC GCG 99 

943 Lys Arg Phe Leu Lys Asn Asn Trp Val Leu Leu Ser Thr Val Ala Ala 

944 15 20 25 
945 

946 GTG GTG CTA GGC ATT ACC ACA GGA GTC TTG GTT CGA GAA CAC AGC AAC 147 

947 Val Val Leu Gly lie Thr Thr Gly Val Leu Val Arg Glu His Ser Asn 

948 30 35 40 
949 

950 CTC TCA ACT CTA GAG AAA TTC TAC TTT GCT TTT CCT GGA GAA ATT CTA 195 

951 Leu Ser Thr Leu Glu Lys Phe Tyr Phe Ala Phe Pro Gly Glu lie Leu 

952 45 50 55 60 
953 

954 ATG CGG ATG CTG AAA CTC ATC ATT TTG CCA TTA ATT ATA TCC AGC ATG 243 

955 Met Arg Met Leu Lys Leu lie lie Leu Pro Leu lie lie Ser Ser Met 

956 65 70 75 
957 

958 ATT ACA GGT GTT GCT GCA CTG GAT TCC AAC GTA TCC GGA AAA ATT GGT 2 91 

959 lie Thr Gly Val Ala Ala Leu Asp Ser Asn Val Ser Gly Lys lie Gly 

960 80 85 90 
961 

962 CTG CGC GCT GTC GTG TAT TAT TTC TGT ACC ACT CTC ATT GCT GTT ATT 33 9 

963 Leu Arg Ala Val Val Tyr Tyr Phe Cys Thr Thr Leu lie Ala Val lie 

964 95 100 105 
965 

966 CTA GGT ATT GTG CTG GTG GTG AGC ATC AAG CCT GGT GTC ACC CAG AAA 387 

967 Leu Gly lie Val Leu Val Val Ser He Lys Pro Gly Val Thr Gin Lys 

968 110 115 120 
969 
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970 GTG GGT 

971 Val Gly 

972 125 
973 

974 GAT GCC 

975 Asp Ala 
976 

977 

978 CAG GCC 

979 Gin Ala 
980 

981 

982 CCC AGC 

983 Pro Ser 
984 

985 

986 ATG ACA 
98 7 Met Thr 
988 190 
989 

990 GGC ATG 

991 Gly Met 

992 205 
993 

994 CTT GTC 

995 Leu Val 
996 

997 

998 CTG GTG 

999 Leu Val 
1000 

1001 

1002 CAG ATC 

1003 Gin He 
1004 

1005 

1006 GGG AAG 

1007 Gly Lys 

1008 270 
1009 

1010 CTT TAC 

1011 Leu Tyr 

1012 285 
1013 

1014 ATT CTC 

1015 He Leu 
1016 

1017 

1018 TTT GCC 

1019 Phe Ala 
1020 



GAA ATT GCG AGG 
Glu He Ala Arg 
130 

ATG TTA GAT CTC 
Met Leu Asp Leu 
145 

TGT TTT CAG CAG 
Cys Phe Gin Gin 
160 

GAT CCA GAG ATG 
Asp Pro Glu Met 
175 

ACT GCA ATT TCC 
Thr Ala lie Ser 



TAT TCA GAT GGC 
Tyr Ser Asp Gly 
210 

TTT GGA CTT GTC 
Phe Gly Leu Val 
225 

GAT TTC TTC AAT 
Asp Phe Phe Asn 
240 

ATC ATG TGT TAT 
lie Met Cys Tyr 
255 

ATC ATA GAA GTT 
He He Glu Val 



ATG GCC ACA GTC 
Met Ala Thr Val 
290 

CCG CTG ATA TAT 
Pro Leu He Tyr 
305 

ATG GGA ATG GCC 
Met Gly Met Ala 
320 



ACA GGC AGC ACC 
Thr Gly Ser Thr 



ATC AGG AAT ATG 
He Arg Asn Met 
150 

TAC AAA ACT AAG 
Tyr Lys Thr Lys 
165 

AAC ATG ACA GAA 
Asn Met Thr Glu 
180 

AAG AAC AAA ACA 
Lys Asn Lys Thr 
195 

ATA AAC GTC CTG 
He Asn Val Leu 



ATT GGA AAA ATG 
He Gly Lys Met 
230 

GCT TTG AGT GAT 
Ala Leu Ser Asp 
245 

ATG CCA CTA GGT 
Met Pro Leu Gly 
260 

GAA GAC TGG GAA 
Glu Asp Trp Glu 
275 

CTG ACT GGG CTT 
Leu Thr Gly Leu 



TTC ATA GTC GTA 
Phe He Val Val 
310 

CAG GCT CTC CTG 
Gin Ala Leu Leu 
325 



CCT GAA GTC AGT 
Pro Glu Val Ser 
135 

TTC CCT GAG AAT 
Phe Pro Glu Asn 



CGT GAA GAA GTG 
Arg Glu Glu Val 
170 

GAG TCC TTC ACA 
Glu Ser Phe Thr 
185 

AAG GAA TAC AAA 
Lys Glu Tyr Lys 
200 

GGC TTG ATT GTC 
Gly Leu He Val 
215 

GGA GAA AAG GGA 
Gly Glu Lys Gly 



GCA ACC ATG AAA 
Ala Thr Met Lys 
250 

ATT TTG TTC CTG 
He Leu Phe Leu 
265 

ATA TTC CGC AAG 
He Phe Arg Lys 
280 

GCA ATC CAC TCC 
Ala He His Ser 
295 

CGA AAG AAC CCT 
Arg Lys Asn Pro 



ACA GCT CTC ATG 
Thr Ala Leu Met 
330 



ACG GTG 435 
Thr Val 
140 

CTT GTC 4 83 

Leu Val 

155 

AAG CCT 531 
Lys Pro 



GCT GTC 579 
Ala Val 



ATT GTT 62 7 

He Val 



TTT TGC 6 75 

Phe Cys 
220 

CAA ATT 723 

Gin He 

235 

ATC GTT 771 
He Val 



ATT GCT 819 
He Ala 



CTG GGC 86 7 

Leu Gly 



ATT GTA 915 
He Val 
300 

TTC CGA 963 

Phe Arg 

315 

ATC TCT 1011 
He Ser 
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1021 

1022 TCC AGT TCA GCA ACA CTG CCT GTC ACC TTC CGC TGT GCT GAA GAA AAT 1059 

1023 Ser Ser Ser Ala Thr Leu Pro Val Thr Phe Arg Cys Ala Glu Glu Asn 

1024 335 340 345 
1025 

1026 AAC CAG GTG GAC AAG AGG ATC ACT CGA TTC GTG TTA CCC GTT GGT GCA 1107 

102 7 Asn Gin Val Asp Lys Arg lie Thr Arg Phe Val Leu Pro Val Gly Ala 
1028 350 355 360 

1029 

1030 ACA ATC AAC ATG GAT GGG ACC GCG CTC TAT GAA GCA GTG GCA GCG GTG 1155 

1031 Thr lie Asn Met Asp Gly Thr Ala Leu Tyr Glu Ala Val Ala Ala Val 

1032 365 370 375 380 
1033 

1034 TTT ATT GCA CAG TTG AAT GAC CTG GAC TTG GGC ATT GGG CAG ATC ATC 1203 

103 5 Phe lie Ala Gin Leu Asn Asp Leu Asp Leu Gly lie Gly Gin lie lie 
1036 385 390 395 
1037 

103 8 ACC ATC AGT ATC ACG GCC ACA TCT GCC AGC ATC GGA GCT GCT GGC GTG 1251 

1039 Thr lie Ser He Thr Ala Thr Ser Ala Ser He Gly Ala Ala Gly Val 

1040 400 405 410 
1041 

1042 CCC CAG GCT GGC CTG GTG ACC ATG GTG ATT GTG CTG AGT GCC GTG GGC 12 99 

1043 Pro Gin Ala Gly Leu Val Thr Met Val He Val Leu Ser Ala Val Gly 

1044 415 420 425 
1045 

1046 CTG CCC GCC GAG GAT GTC ACC CTG ATC ATT GCT GTC GAC TGG CTC CTG 134 7 

1047 Leu Pro Ala Glu Asp Val Thr Leu He He Ala Val Asp Trp Leu Leu 

1048 430 435 440 
1049 

1050 GAC CGG TTC AGG ACC ATG GTC AAC GTC CTT GGT GAT GCT TTT GGG ACG 13 95 

1051 Asp Arg Phe Arg Thr Met Val Asn Val Leu Gly Asp Ala Phe Gly Thr 

1052 445 450 455 460 
1053 

1054 GGC ATT GTG GAA AAG CTC TCC AAG AAG GAG CTG GAG CAG ATG GAT GTT 1443 

1055 Gly He Val Glu Lys Leu Ser Lys Lys Glu Leu Glu Gin Met Asp Val 

1056 465 470 475 
1057 

1058 TCA TCT GAA GTC AAC ATT GTG AAT CCC TTT GCC TTG GAA TCC ACA ATC 1491 

1059 Ser Ser Glu Val Asn He Val Asn Pro Phe Ala Leu Glu Ser Thr He 

1060 480 485 490 
1061 

1062 CTT GAC AAC GAA GAC TCA GAC ACC AAG AAG TCT TAT GTC AAT GGA GGC 153 9 

1063 Leu Asp Asn Glu Asp Ser Asp Thr Lys Lys Ser Tyr Val Asn Gly Gly 

1064 495 500 505 
1065 

1066 TTT GCA GTA GAC AAG TCT GAC ACC ATC TCA TTC ACC CAG ACC TCA CAG 1587 

1067 Phe Ala Val Asp Lys Ser Asp Thr He Ser Phe Thr Gin Thr Ser Gin 

1068 510 515 520 
1069 

1070 TTC TAGGGCCCCT GGCTGCAGAT GACTGGAAAC AAGGAAGGAC ATTTCGTGAG 1640 

1071 Phe 
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1072 525 
1073 

1074 AGTCATCTCA AACACGGCTT AAGGAAAAGA GAAA 1674 

1075 

1076 

1077 (2) INFORMATION FOR SEQ ID NO : 9 : 
1078 

1079 (i) SEQUENCE CHARACTERISTICS: 

1080 (A) LENGTH: 525 amino acids 

1081 (B) TYPE: amino acid 

1082 (D) TOPOLOGY: linear 
1083 

1084 (ii) MOLECULE TYPE: protein 

1085 

1086 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

1087 

1088 Met Gly Lys Pro Ala Arg Lys Gly Cys Pro Ser Trp Lys Arg Phe Leu 

1089 1 5 10 15 
1090 

1091 Lys Asn Asn Trp Val Leu Leu Ser Thr Val Ala Ala Val Val Leu Gly 

1092 20 25 30 
1093 

1094 lie Thr Thr Gly Val Leu Val Arg Glu His Ser Asn Leu Ser Thr Leu 

1095 35 40 45 
1096 

1097 Glu Lys Phe Tyr Phe Ala Phe Pro Gly Glu lie Leu Met Arg Met Leu 

1098 50 55 60 
1099 

1100 Lys Leu lie lie Leu Pro Leu lie lie Ser Ser Met lie Thr Gly Val 

1101 65 70 75 80 
1102 

1103 Ala Ala Leu Asp Ser Asn Val Ser Gly Lys lie Gly Leu Arg Ala Val 

1104 85 90 95 
1105 

1106 Val Tyr Tyr Phe Cys Thr Thr Leu He Ala Val He Leu Gly He Val 

1107 100 105 110 
1108 

1109 Leu Val Val Ser He Lys Pro Gly Val Thr Gin Lys Val Gly Glu He 

1110 115 120 125 
1111 

1112 Ala Arg Thr Gly Ser Thr Pro Glu Val Ser Thr Val Asp Ala Met Leu 

1113 130 135 140 
1114 

1115 Asp Leu He Arg Asn Met Phe Pro Glu Asn Leu Val Gin Ala Cys Phe 

1116 145 150 155 160 
1117 

1118 Gin Gin Tyr Lys Thr Lys Arg Glu Glu Val Lys Pro Pro Ser Asp Pro 

1119 165 170 175 
1120 

1121 Glu Met Asn Met Thr Glu Glu Ser Phe Thr Ala Val Met Thr Thr Ala 

1122 180 185 190 
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1123 
1124 
1125 
1126 
1127 
1128 
1129 
1130 
1131 
1132 
1133 
1134 
1135 
1136 
1137 
1138 
1139 
1140 
1141 
1142 
1143 
1144 
1145 
1146 
1147 
1148 
1149 
1150 
1151 
1152 
1153 
1154 
1155 
1156 
1157 
1158 
1159 
1160 
1161 
1162 
1163 
1164 
1165 
1166 
1167 
1168 
1169 
1170 
1171 
1172 
1173 



lie Ser Lys Asn Lys Thr Lys Glu Tyr Lys lie Val Gly Met Tyr Ser 
195 200 205 

Asp Gly lie Asn Val Leu Gly Leu lie Val Phe Cys Leu Val Phe Gly 
210 215 220 

Leu Val lie Gly Lys Met Gly Glu Lys Gly Gin lie Leu Val Asp Phe 

225 230 235 240 



Phe Asn Ala 



Cys Tyr Met 



Glu Val Glu 
275 

Thr Val Leu 
290 

lie Tyr Phe 
305 

Met Ala Gin 



Thr Leu Pro 



Lys Arg lie 
355 

Asp Gly Thr 
370 

Leu Asn Asp 
385 

Thr Ala Thr 



Leu Val Thr 



Asp Val Thr 
435 



Leu Ser 
245 

Pro Leu 
260 



Asp Ala 
Gly He 



Asp Trp Glu lie 
Thr Gly 
He Val 



Thr Met Lys 
250 

Leu Phe Leu 
265 

Phe Arg Lys 
280 



lie Val Gin He 



lie Ala Gly Lys 
270 



He Met 
255 

He He 



Leu Gly Leu Tyr Met Ala 
285 



Leu Ala 
295 



He His Ser He Val He Leu Pro Leu 
300 



Ala Leu 
325 

Val Thr 
340 



Val Arg 
310 

Leu Thr 



Phe Arg 



Thr Arg Phe Val 



Lys Asn Pro 



Ala Leu Met 
330 

Cys Ala Glu 
345 

Leu Pro Val 
360 



Phe Arg Phe Ala 
315 

He Ser Ser Ser 



Met Gly 
320 

Ser Ala 
335 



Glu Asn Asn Gin Val Asp 
350 

Gly Ala Thr He Asn Met 
365 



Ala Leu 



Leu Asp 



Ser Ala 
405 

Met Val 
420 



Tyr Glu 
375 

Leu Gly 
390 

Ser He 



He Val 



Ala Val Ala Ala Val Phe He Ala Gin 
380 



Leu He He Ala 



He Gly Gin 



Gly Ala Ala 
410 

Leu Ser Ala 
425 

Val Asp Trp 
440 



He He Thr He 
395 

Gly Val Pro Gin 



Ser He 
400 

Ala Gly 
415 



Val Gly Leu Pro Ala Glu 
430 

Leu Leu Asp Arg Phe Arg 
445 



Thr Met Val Asn Val Leu Gly Asp Ala Phe Gly Thr Gly He Val Glu 
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1174 450 455 460 

1175 

1176 Lys Leu Ser Lys Lys Glu Leu Glu Gin Met Asp Val Ser Ser Glu Val 

1177 465 470 475 480 
1178 

1179 Asn lie Val Asn Pro Phe Ala Leu Glu Ser Thr lie Leu Asp Asn Glu 

1180 485 490 495 
1181 

1182 Asp Ser Asp Thr Lys Lys Ser Tyr Val Asn Gly Gly Phe Ala Val Asp 

1183 500 505 510 
1184 

1185 

1186 Lys Ser Asp Thr lie Ser Phe Thr Gin Thr Ser Gin Phe 

1187 515 520 525 
1188 

1189 

1190 (2) INFORMATION FOR SEQ ID NO: 10: 
1191 

1192 (i) SEQUENCE CHARACTERISTICS: 

1193 (A) LENGTH: 28 base pairs 

1194 (B) TYPE: nucleic acid 

1195 (C) STRANDEDNESS : single 

1196 (D) TOPOLOGY: linear 
1197 

1198 (ii) MOLECtJLE TYPE: DNA (genomic) 

1199 

1200 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

1201 

12 02 CGCGGGTACC GCCATGGAGA AGAGCAAC 28 
1203 

12 04 (2) INFORMATION FOR SEQ ID NO: 11: 
1205 

1206 (i) SEQUENCE CHARACTERISTICS: 

1207 (A) LENGTH: 29 base pairs 

1208 (B) TYPE: nucleic acid 

1209 (C) STRANDEDNESS: single 

1210 (D) TOPOLOGY: linear 
1211 

1212 (ii) MOLECULE TYPE: DNA (genomic) 

1213 

1214 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

1215 

1216 CGCGTCTAGA TCACAGAACC GACTCCTTG 2 9 

1217 

1218 (2) INFORMATION FOR SEQ ID NO: 12: 
1219 

1220 (i) SEQUENCE CHARACTERISTICS: 

1221 (A) LENGTH: 29 base pairs 

1222 (B) TYPE: nucleic acid 

1223 (C) STRANDEDNESS: single 

1224 (D) TOPOLOGY: linear 
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1225 

122 6 (ii) MOLECULE TYPE: DNA (genomic) 
1227 

1228 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

1229 

123 0 CGCGGGTACC AATATGACTA AAAGCAATG 2 9 
1231 

1232 (2) INFORMATION FOR SEQ ID NO: 13: 
1233 

1234 (i) SEQUENCE CHARACTERISTICS: 

123 5 (A) LENGTH: 2 9 base pairs 

1236 (B) TYPE: nucleic acid 

1237 (C) STRANDEDNESS : single 

123 8 (D) TOPOLOGY: linear 
1239 

124 0 (ii) MOLECULE TYPE: DNA (genomic) 
1241 

1242 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

1243 

1244 CGCGTCTAGA CTACATCTTG GTTTCACTG 2 9 

1245 

1246 (2) INFORMATION FOR SEQ ID NO: 14: 
1247 

1248 (i) SEQUENCE CHARACTERISTICS: 

124 9 (A) LENGTH: 2 9 base pairs 

1250 (B) TYPE: nucleic acid 

1251 (C) STRANDEDNESS: single 

1252 (D) TOPOLOGY: linear 
1253 

1254 (ii) MOLECULE TYPE: DNA (genomic) 

1255 

12 56 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

1257 

12 58 CGCGGGTACC ACCATGGCAT CTACGGAAG 2 9 

1259 

1260 (2) INFORMATION FOR SEQ ID NO: 15: 
1261 

1262 (i) SEQUENCE CHARACTERISTICS: 

1263 (A) LENGTH: 30 base pairs 

1264 (B) TYPE: nucleic acid 

1265 (C) STRANDEDNESS: single 

1266 (D) TOPOLOGY: linear 
1267 

1268 (ii) MOLECULE TYPE: DNA (genomic) 

1269 

1270 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

1271 

1272 CGCGTCTAGA TTATTTCTCA CGTTTCCAAG 3 0 

1273 

12 74 (2) INFORMATION FOR SEQ ID NO : 16 : 
1275 
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1276 (i) SEQUENCE CHARACTERISTICS: 

1277 (A) LENGTH: 2 8 base pairs 

1278 (B) TYPE: nucleic acid 

1279 (C) STRANDEDNESS : single 

1280 (D) TOPOLOGY: linear 
1281 

12 82 (ii) MOLECULE TYPE: DNA (genomic) 
1283 

1284 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16 : 
1285 

1286 CGCGGGTACC GCCATGGGGA AACCGGCG 28 
1287 

1288 (2) INFORMATION FOR SEQ ID NO: 17: 
1289 

1290 (i) SEQUENCE CHARACTERISTICS: 

1291 (A) LENGTH: 28 base pairs 

1292 (B) TYPE: nucleic acid 

1293 (C) STRANDEDNESS: single 
12 94 (D) TOPOLOGY: linear 
1295 

12 96 (ii) MOLECULE TYPE: DNA (genomic) 
1297 

12 98 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
1299 

1300 CGCGGGATCC CTAGAACTGT GAGGTCTG 2 8 

1301 
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IT 
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